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Abstract 

We combine evolutionary biology and community ecology to test whether two species traits, body size and geographic 
range, explain long term variation in local scale freshwater stream fish assemblages. Body size and geographic range are 
expected to influence several aspects offish ecology, via relationships with niche breadth, dispersal, and abundance. These 
traits are expected to scale inversely with niche breadth or current abundance, and to scale directly with dispersal potential. 
However, their utility to explain long term temporal patterns in local scale abundance is not known. Comparative methods 
employing an existing molecular phylogeny were used to incorporate evolutionary relatedness in a test for covariation of 
body size and geographic range with long term (1 983 - 201 0) local scale population variation of fishes in West Fork White 
River (Indiana, USA). The Bayesian model incorporating phylogenetic uncertainty and correlated predictors indicated that 
neither body size nor geographic range explained significant variation in population fluctuations over a 28 year period. 
Phylogenetic signal data indicated that body size and geographic range were less similar among taxa than expected if trait 
evolution followed a purely random walk. We interpret this as evidence that local scale population variation may be 
influenced less by species-level traits such as body size or geographic range, and instead may be influenced more strongly 
by a taxon's local scale habitat and biotic assemblages. 
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Introduction 

Attributing stream fish assemblage dynamics to random or 
deterministic factors is a long standing theme of community 
ecology [1,2]. A current paradigm is that assemblages are highly 
organized by a variety of abiotic and biotic variables dictated by 
geographic and evolutionary scale [3,4]. Specifically, local 
assemblage variation is linked to local scale factors such as 
predation [5], competition [6], habitat quality [7], and regional 
scale factors such as watershed land use type and history [8], 
stream size [9], and geologic history [10]. Unexplained assemblage 
variation is typically attributed to random noise or otiier untested 
mechanisms. Ultimately, however, assemblage patterns or char- 
acteristics are an emergent product of variation at the population 
level [11]. 

In addition to biotic and abiotic scale dependent factors, body 
size and geographic range are not necessarily independent of 
assemblage variation [12]. An inverse relationship between body 
size and abundance is expected as a function of energetic 
constraints [13] in both terrestrial [14] and aquatic [15] 
assemblages/ecosystems. Furthermore, macroecological studies 
have demonstrated a relationship between body size and 
geographic range [16,17]. The expectation is that larger sized 



individuals are more capable of long range movements and thus, 
exhibit increased range sizes. 

However, the utility of body size and geographic range as model 
predictors to describe long term population dynamics is under- 
studied. Conceptually, small bodied species are expected to exhibit 
greater population variation as a result of higher intrinsic rates of 
increase ?■ [17]. Similarly, species with larger geographic ranges are 
expected to be generahsts for environmental niches [18] and more 
likely to exhibit stable populations. 

However, there are complications with testing the relationship 
between population variation and traits such as body size and 
geographic range. Traits are not independendy distributed across 
species, due to varying lengths of shared evolutionary history 
among related species. Thus, comparative analyses account for the 
expected covariance structure across species, based on hypothe- 
sized evolutionary relationships. Testing for phylogenetic signal 
(e.g. Blomberg's K [19]) provides predictable patterns concordant 
with expected levels of evolutionary covariance (Brownian motion 
model of an evolutionary random walk), or alternatively, 
covariances may be lower (indicating more diverging paths, or 
convergence of unrelated species) or higher (indicating more 
conserved traits). Furthermore, interpretation of phylogenetic 



PLOS ONE I www.plosone.org 



1 



April 2014 I Volume 9 | Issue 4 | e93522 



Fish Populations Do Not Vary with Size or Range 



Table 1. Species included In analysis with descriptions of CV (long term population variation among sites), maximum body size 
(cm), and geographic range (km^). 



Taxa 


CV (min - max) 


Maximum body size (cm) 


Geographic range (km^) 


Moxostoma spp. 


0.40 - 1.10 


78 


1787287 


Hypentelium nigricans 


0.52 - 0.96 


61 


1581169 


Catostomus commersonii 


0.72 - 1.39 


64 


8850545 


Minytrema melanops 


0.57 - 1 .04 


50 


1739172 


Carpiodes cyprinus 


0.71 - 1.73 


66 


2364892 


Carpiodes velifer 


3.26 - 4.80 


50 


1110876 


Cyprinus carpio 


0.66 - 1 .24 


120 


8850545 


Notropis rubellus 


2.00 - 4.80 


9 


1697659 


Cyprinella whipplei 


1.96 - 3.31 


16 


497304 


Cyprinella spiloptera 


1.02 - 3.32 


12 


1516050 


Notropis stramineus 


1 .39 - 4.80 


8.1 


3024895 


Notropis photogenis 


1.11 - 2.09 


14 


481459 


Pimephales notatus 


1.33 - 2.75 


11 


2784561 


Luxitus ctirysoceptialus 


0.70 - 2.94 


18 


1200793 


Lythrurus umbratilis 


1.59 - 4.80 


8.6 


1260469 


Ictaturus punctatus 


1 .24 - 4.69 


127 


4899489 


Ameiurus natalis 


1.00 - 3.09 


47 


3623129 


Dorosoma cepedianum 


1.29 - 3.17 


52 


4559801 


Ambiopiites rupestris 


0.40 - 0.86 


43 


2204814 


Pomoxis nigromaculatus 


1.00 - 2.11 


49 


3196197 


Pomoxis annularis 


1.15-1.84 


53 


2985169 


Micropterus satmoides 


0.60 ~ 1 .28 


97 


3468067 


Micropterus dolomieu 


0.47 - 0.97 


69 


1817217 


Lepomis gibbosus 


3.16 - 4.80 


40 


1980638 


Lepomis microloptius 


2.09 - 4.69 


25 


1523148 


Lepomis megalotis 


0.71 - 0.90 


24 


2693156 


Lepomis cyanellus 


0.78 - 1 .00 


31 


3849721 


Lepomis macroctiirus 


0.64 -1.19 


41 


3574355 


Percina caprodes 


0.82 - 1.29 


18 


3911619 


Percina maculata 


1.11 - 2.88 


11 


2035511 


Etheostoma nigrum 


2.06 - 4.81 


7.2 


3713135 


Etheostoma btennioides 


1.26 - 4.21 


17 


818767 



doi:l 0.1 371 /journal.pone.0093522.t001 



signal values can facilitate conclusions regarding broad evolution- 
ary process of trait convergence or divergence [20] . 

A second complication involves quantitative issues associated 
with incorporating phylogeny into models which describe varia- 
tion among taxa and the presence of coUinearity among 
predictors. Body size and geographic range are correlated [12], 
yet, the relationship between these predictors and variation in 
abundance is of great interest [21,22]. Incorporation of phylogeny 
into a model describing variation in abundance while accounting 
for coUinearity among predictors is problematic with generalized 
least squares methods. However, Bayesian inference is an 
alternative statistical methodology which has been shown to result 
in more precise parameter estimates in phylogenetic models while 
accounting for coUinearity among predictors [23,24,25]. 

The primary objective of this study was to test if body size and 
geographic range influence long term variation in local scale 
stream fish species abundance. Our secondary objective was to 
evaluate the phylogenetic signal of body size and geographic range 



associated with the stream fishes represented in our study. We 
hypothesized that body size and geographic range are negatively 
related to increased variation in long term population dynamics. 
We expected that taxa with small bodies and small geographic 
ranges would exhibit greater temporal variation in abundance as a 
result of energetic constraints, r vs. K selection mode, and smaU 
environmental niche. 

Materials and Methods 

Field collection 

Fish were sampled yearly at six sites from 1983 to 2010 in the 
West Fork White River in East-Central Indiana (Indiana 
Department of Natural Resources Permit -JCD ^ 10-0098; see 
Table 1). Fish were coUected foUowing Simon and Dufour [26] 
and the Ohio Environmental Protection Agency for assessment of 
streams in the Eastern Corn Belt Plains ecoregion (Ohio 
Environmental Protection Agency (OEPA)) in accordance with 
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BS GR 




Moxostoma spp. 
Hypentelium nigricans 
Catostomus commersonii 
IVIinytrema melanops 
Carpiodes cyprinus 
Carpiodes velifer 
Cyprinus carpio 
Notropis rubellus 
Cyprinella whipplei 
Cyprineila spiloptera 
Notropis stramineus 
Notropis photogenis 
Pimephales notatus 
Luxilus chrysocepiialus 
Lythrurus umbratilis 
ictalurus punctatus 
Ameiurus natalis 
Dorosoma cepedianum 
Ambloplites rupestris 
Pomoxis nigromaculatus 
Pomoxis annuiaris 
Micropterus salmoides 
Micropterus doiomieu 
Lepomis gibbosus 
Lepomis microlophus 
Lepomis megalotis 
Lepomis cyanellus 
Lepomis macrochirus 
Percina caprodes 
Percina maculata 
Etheostoma nigrum 
Etheostoma blennioides 



Figure 1 . Phylogeny of study taxa with body size (BS) and geograpliic range (GR) categories. Darker bars indicate higher values. See 

Table 1 for raw data values. 

doi:1 0.1 371/journal.pone.0093522.g001 



American Fisheries Society guidelines for the safe and ethical use 
of fishes in research (http://fisheries.org/). Sampling was 
completed at normal pool water levels while turbidity was less 
than 40 Nephelometric Turbidity Units. AU sites were sampled 
with a boat mounted Smith-Root model 5.0 GPP electrofisher 
with a 5000-watt generator. Sampling proceeded on a hnear reach 
for a distance of 15 times the wetted width with a minimum 
distance of 500 m. Fish were collected using a 3 mm stretch mesh 



net and placed into a live well for processing. All fish (see Table 1) 
were identified to species using regional keys [27], counted, and 
released at the site. Voucher specimens curated at the Bureau of 
Water Quality, Muncie, Indiana were also used for species 
identification. All sites were sampled as part of the Bureau of 
Water Quahties long-term fisheries monitoring program in White 
River. 
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Data summary 

Abundance per site was expressed as electrofishing catch per 
1000 km. Body size and geographic range for each taxon were 
estimated using the Fish Traits database and standardized to their 
z-score (fishwild.vt.edu/fishtraits/ [2 7]). Z-scores were calculated 
as follows: 



Where x is the observation, x~ is the mean value of the sample, 
and s is the standard deviation of the sample. The Fish Traits 
database has been concatenated from numerous regional and local 
distribution and life histor)- studies [28] and can be used in large 
taxonomic scale studies [29]. Taxonomic relationships used in 
comparative analyses (Figure 1) were from published molecular 
studies of Catostomidae [30], Cyprinidae [31,32,33], Centrarch- 
idae [34], Percidae [35], and Ictaluridae [36]. Higher order 
relationships (e.g. family) were from Betancur-R et al. [37]. 

Statistical analysis 

Long term variation in species abundance was estimated as the 
coefficient of variation, cv, for each species at each site. 



Where cvij is the coefficient of variation for species i at site j, Sij is 
the standard deviation of species i at site j, and x~;j is the mean 
abundance of species i at site j. Given the setup of the model only 
taxa that were collected at least once at each site over the 
collection period could be included. This resulted in a single 
species by site matrix of cv values (i.e. for species by sites). 

We modeled cvjj as a linear function of body size and 
geographic range incorporating phylogenetic relationships follow- 
ing de VUlemereuil et al. [23]. Here cvj, is modeled as a 
multivariate normal distribution where the mean is a linear 
function of body size, bs„ and geographic range, gr„ and the 
variance-covariance matrix, S, is proportional to the shared 
branch lengths from the root of the tree to the common ancestor of 
each pair of taxa (Figure 2). 



/I, = a + PibSi + fi2grj 

Where /X; is the mean of each species cv from the multivariate 
normal distribution (mnorm), a. is the intercept and represents the 
hypothetical mean cv with a body size and geographic range of 
zero, and fSj and are model coefficients representing the efiect 
of body size and geographic range. 

We used published molecular hypotheses to represent the 
phylogenetic relationship between species and used this single tree 
with an inverse-Wishart prior as a prior for the variance- 
covariance matrix, 2 [23]. We assumed equal branch lengths. 
As caveat to this analysis, if model parameters are identified as 
important the robustness of the model to choice of variance- 
covariance structure could be evaluated by generating and using a 
distribution of random trees [38] as the prior for the variance- 
covariance matrix. 
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Figure 2. Variance-covariance matrix of a generalized phylo- 
genetic tree. Variance is set to the branch length from the root to the 
tip and the covariance is the branch length from the root to the most 
recent common ancestor (adapted from de Viilemereuil et al. 2012). 
doi:1 0.1 371/journai.pone.0093522.g002 

Since body size and geographic range are known to be 
correlated [12] we used a Bayesian Lasso approach to include 
both variables in the model. The Bayesian Lasso is a variable 
selection technique that uses a double-exponential prior on the 
coefficients [24,25]. The Bayesian Lasso wiU puU the weakest 
parameter to 0 thus providing a variable selection method with 
correlated predictors. 

We used Bayesian inference to estimate parameters of the 
model. Bayesian inference is based on Bayes' Theorem: 



pie\x) = 



p{x\e)p{e) 
lP{x\e)P{e)de 



Where P[X\9) is the likelihood function and represents the 
probability of the data, X, given the parameters, 6, P(9) is the prior 
distribution of the parameters, 6, and the denominator is a 
normalizing parameter. 

We used vague (i.e., noninformative) priors for all model 
parameters except the variance-covariance matrix, 2, to specify 
our prior uncertainty about the model parameters. The variance- 
covariance matrix, S, prior was constructed as the inverse of the 
single phylogenetic tree matrix specified above in a Wishart prior. 
We used the freely available JAGS 3.3 program [39] implemented 
in R 2.15.3 [40] using the rjags package [41]. Complete model 
specifications in the JAGS language can be found in Appendix SI 
of the Supporting Information. We ran 3 MCMC chains for a 
total of 125,000 steps, discarding the first 25,000 steps as a burn-in 
period, and thinning every 5 steps. The burn-in period is necessary 
to reduce the efiect of the starting values on the MCMC results 
[42] . Convergence of the MCMC algorithm was assessed using the 
Brooks-Gebnan-Rubin (BGR) scale-reduction factor [43]. The 
BGR factor is the ratio of between chain variability to within chain 
variability. Convergence is obtained when the upper limit of the 
BGR factor is close to 1.00 indicating there is not more variability 
between chains compared to within chains. Values below 1.10 are 
considered acceptable [42] . We additionally performed a posterior 
predictive check to evaluate model fit. This was conducted by 
calculating the posterior mean of the overall coefficient of 
variation for each species at each step in the Markov Chain. 
The 95% credible intervals from the estimated coefficient of 
variation was compared to the mean value for each species. 



PLOS ONE I www.plosone.org 



4 



April 2014 | Volume 9 | Issue 4 | e93522 



Fish Populations Do Not Vary with Size or Range 



Table 2. Parameter estimates from the coefficient of variation 
model. 



overlapped the observed mean value (Figure ,3). Species with high 
observed average cv corresponded with a high credible inteival 
values. 



Effect Median 95% Credible interval 



Intercept 


1.44 


1.19 - 


1.69 


Geographic range 


-0.08 


-0.48 


- 0.13 


Body size 


-0.17 


-0.58 


- 0.04 



Parameter estimates are considered statistically significant when 95% credible 

Intervals do not overlap 0. 

doi:l 0.1 371 /journal.pone.0093522.t002 



Results 

Summary 

The analysis included 48,071 individuals comprised of 32 
species collected from 6 sites along the West Fork White River 
(Muncie, IN, USA) spanning 1983 to 2010 (Table 1). Taxon body 
size range was from 6.5 to 155 cm (mean 35 cm) and geographic 
ranges were from 481,459 to 8,850,545 km^ (mean 
2,831,692 km^). 

Bayesian hierarchical model 

The BGR statistic for all parameters were less than 1.10 
indicating the model converged after 100,000 iterations (33,333 
steps per chain). The 95% credible interval estimates of the 
parameters for body size and geographic range overlapped 0 
(Table 2), indicating there is no credible evidence to support a 
relationship with species coefficient of variation given the 
phylogenetic tree. When modeled separately with a normally 
distributed prior the posterior distribution of the body size and 
geographic range coefficient did not overlap 0. All of the 95% 
credible intervals from the posterior predictive check of the cv 



Phylogenetic signal 

Body size exhibited low phylogenetic signal (A" 0.57; P<0.001; 
Figure 1 ) indicating size distributions among taxa less similar than 
expected. Geographic range also had low phylogenetic signal {K 
0.33; P=0.07; Figure 1). 

Discussion 

Long term variation in stream fish population abundances did 
not covary with body size or geographic range of taxa. This 
finding is contrary to our initial expectations; however, we do not 
interpret this as evidence that White River stream fish assemblages 
are random or stochastic. In a recent study of the same White 
River fish assemblage, Jacquemin and DoU [29] attributed a 
significant portion of the long term variation to differences in 
habitat and niche breadth (measured as association with particular 
substrate types, flow regime, woody debris, submerged vegetation, 
and distribution elevation) and responses to environmental 
variation among species. Specifically, Jacquemin and DoU [29] 
found that species with more general habitat niches showed 
smaller fluctuations in abundance through time. We interpret this 
as evidence that local scale stream fish assemblages are more 
closely aligned with environmental variation as a result of their 
respective niches than other traits such as body size or geographic 
range. However, while long term data provide a robust measure of 
local assemblage variation we suggest expanding spatial and 
taxonomic coverage through the addition of sites in other 
watersheds that may yield different results. 

Ignoring multicoUinearity in model parameters (e.g., body size 
and geographic range) can result in increased standard errors of 



o 
CO 



o 

O 

o 



ik iv 



,_ 5 



(U ^ ^ U) 
*^ o 03 n3 



c= ^ fn >- u OJ 

CD ™fco,-'''-!„,D-n' 



^ 2 3 tn p !s 

CQ i5 
m 



=! a> £-"0 
_ « 2 TO >, 5 

^ (J O ^ O on TO 

CD ^ X -I 



03 

0 05 



— ' O =J ^ 



j= -c 
tn to 5 



cn cn 1- 

03 O 

IP 



2 5 ■ 
o in . 



0 03 flj 



Species 



Figure 3. Results of the coefficient of variation posterior predictive check from the Bayesian hierarchical model. Points are mean 
estimates from the 95% credible intervals, vertical bars are the bounds of the 95% credible intervals, and solid triangles are observed mean coefficient 
of variation values for each species. 
doi:1 0.1 371/journal.pone.0093522.g003 
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the coefHcients which can result in variables being found non- 
significant in traditional analysis. Thus, the relationship between 
variation in abundance with body size and geographic range is 
often conducted independently [16,21,22]. Typically, multicoUi- 
nearity issues are addressed by increasing sample size or removing 
one of the intercorrelated variables. Increasing sample size is often 
not an option, particularly when analyzing long term data sets. 
Further, removal of a variable may not be an option when there is 
strong theoretical justification for including both. This study is the 
first to our knowledge that tests for a relationship between 
variation in abundance with body size and geographic range in the 
same model. The methods used here permit the inclusion of the 
correlated variables and provided a quantitative method of 
determining what variable is more important in driving variation 
in abundance when the correlated variables are considered 
important when tested individually. 

Interestingly, our results for phylogenetic signal (low A values) of 
body size and geographic range implies less similarity among close 
relatives in the assemblage than expected under a Brownian 
model. Low K values are typically attributed to high levels of 
divergence, the opposite of niche conservatism [20] . One potential 
source of influence outside of divergence may also be our use of 
branch lengths in the analysis. Kraft et al. [44] interpret an 
assemblage level of highly 'derived traits' as evidence for habitat 
filtering influence on taxonomic assemblage variation. Further 
study of phylogenetic signal of ecologically relevant traits may 
improve understanding of assembly patterns in freshwater stream 
assemblages. 

We suggest that our results are particularly relevant to 
conser\'ation biology. Rabinowitz [4,'i] and others [46,47] identi- 
fied utility in using life history traits to define rarity and extinction 
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